Skip to content

Add learner tests for SAC and TD3 agents#369

Open
irhyl wants to merge 1 commit into
google-deepmind:masterfrom
irhyl:master
Open

Add learner tests for SAC and TD3 agents#369
irhyl wants to merge 1 commit into
google-deepmind:masterfrom
irhyl:master

Conversation

@irhyl

@irhyl irhyl commented Jun 29, 2026

Copy link
Copy Markdown

SAC and TD3 are two of the most commonly used continuous control agents in Acme, but neither has a learner test. This adds parameterized integration tests for both.

The SAC test covers adaptive and fixed entropy coefficient modes, since those take different code paths through SACLearner (the adaptive case initializes and updates a separate alpha optimizer). The TD3 test covers standard TD3 and the TD3+BC variant (bc_alpha set), since that changes the policy loss computation.

Both follow the same pattern as the existing tests in crr/ and cql/: fake continuous environment, fake transition iterator, small hidden layers to keep runtime low.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant